Picture for Abhinav Bhatele

Abhinav Bhatele

Optimizing Agentic Language Model Inference via Speculative Tool Calls

Add code
Dec 17, 2025
Figure 1 for Optimizing Agentic Language Model Inference via Speculative Tool Calls
Figure 2 for Optimizing Agentic Language Model Inference via Speculative Tool Calls
Figure 3 for Optimizing Agentic Language Model Inference via Speculative Tool Calls
Figure 4 for Optimizing Agentic Language Model Inference via Speculative Tool Calls
Viaarxiv icon

LLM Inference Beyond a Single Node: From Bottlenecks to Mitigations with Fast All-Reduce Communication

Add code
Nov 13, 2025
Figure 1 for LLM Inference Beyond a Single Node: From Bottlenecks to Mitigations with Fast All-Reduce Communication
Figure 2 for LLM Inference Beyond a Single Node: From Bottlenecks to Mitigations with Fast All-Reduce Communication
Figure 3 for LLM Inference Beyond a Single Node: From Bottlenecks to Mitigations with Fast All-Reduce Communication
Figure 4 for LLM Inference Beyond a Single Node: From Bottlenecks to Mitigations with Fast All-Reduce Communication
Viaarxiv icon

Modeling Code: Is Text All You Need?

Add code
Jul 15, 2025
Viaarxiv icon

Power Law Guided Dynamic Sifting for Efficient Attention

Add code
Jun 05, 2025
Viaarxiv icon

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Add code
May 28, 2025
Viaarxiv icon

Plexus: Taming Billion-edge Graphs with 3D Parallel GNN Training

Add code
May 07, 2025
Viaarxiv icon

The Big Send-off: High Performance Collectives on GPU-based Supercomputers

Add code
Apr 25, 2025
Viaarxiv icon

Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers

Add code
Feb 12, 2025
Figure 1 for Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers
Figure 2 for Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers
Figure 3 for Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers
Figure 4 for Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers
Viaarxiv icon

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

Add code
Feb 07, 2025
Figure 1 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Figure 2 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Figure 3 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Figure 4 for Gemstones: A Model Suite for Multi-Faceted Scaling Laws
Viaarxiv icon

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Add code
Feb 07, 2025
Figure 1 for Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Figure 2 for Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Figure 3 for Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Figure 4 for Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Viaarxiv icon